Mandoku – An Incubator for Premodern Chinese Texts – or How to Get the Text We Want: An Inquiry into the Ideal Workflow
نویسنده
چکیده
Premodern Chinese texts pose problems that are difficult to accommodate with the current TEI text model, which bases the main hierarchy of a text on its structural content, rather than on a hierarchy that models the pages, lines and character positions. For the TEI, this is a sensible decision and has led to the abolishment of elements like and in the latest release of the Guidelines. For premodern Chinese texts however, especially texts that are transmitted as manuscripts or woodblock printings and have not yet seen a modern edition printed with movable type (let alone as, more recently, computerized typesetting), establishing the structural hierarchy of the text content is, together with the even more daunting question of establishing the proper characters of the text (on which see below), an important part of the research question that motivates the digitization of the text. Requiring an answer to this question before a proper electronic text can be created makes this intractable in the digital medium and glosses over an important leap of faith in the creation of a TEI encoded text. In this paper, I will try to trace some of the implications and propose an approach that allows different models of the text for different stages in the encoding process, thus closer modeling the process of the creation of an electronic text.
منابع مشابه
Inquiry into Scientific Correctness of the Elementary School Farsi Textbooks
Inquiry into Scientific Correctness of the Elementary School Farsi Textbooks H. Ghamari, Ph.D.* Scientific validity or correctness is of paramount importance when it comes to the contents of textbooks used in elementary schools, as the very foundations of learners’ knowledge is constructed therein. Hence, an analysis of these contents from this perspective is necessary in order to guide...
متن کاملThe Compilation of Urbanism Texts by Using the Iranian's-Valuable Texts (With Emphasis on the Islamic Ethics)
It is clear that each community should be have the specific urbanism science. Science localization is an obvious matter. This matter has motivated Iranian researchers, in urbanism field, to naturalize urbanism science having been imported to Iran. One method for producing or indigenizing urbanism texts in Iran, especially in recent years, is Utilization of Iranian-valuable texts. There are high...
متن کاملA Critical Inquiry into the Jurisprudential Foundation of Article 665 of the Islamic Penal Code (A Critique of the Theory of the Blood Money of the Left Testicle being more Than the Right One Based on Religious Texts and the Medicine)
Many Imamiyeh Jurists have considered the blood money for the testicles as an exception to the principle of pair and single organs, and hence, have considered a more blood money for the left testicle than the right one. Article 665 of the Islamic Penal Code, following this group of jurists, has adopted the same procedure. This assumption, one the one hand, is the result of the reference to some...
متن کاملImproving Precision of Keywords Extracted From Persian Text Using Word2Vec Algorithm
Keywords can present the main concepts of the text without human intervention according to the model. Keywords are important vocabulary words that describe the text and play a very important role in accurate and fast understanding of the content. The purpose of extracting keywords is to identify the subject of the text and the main content of the text in the shortest time. Keyword extraction pl...
متن کاملSyntactic Complexity of Russian Unified State Exam Texts in English: A Study on Reliability and Validity
In this study we analyze texts used in Russian Unified State Exam on English language. Texts that formed small research corpora were retrieved from 2 resources: official USE database as a reference point, and popular website used by pupils for USE training “Neznaika” (https://neznaika.pro/). The size of two corpora is balanced: USE has 11934 tokens and “Neznaika” - 11918 tokens. We share Biber’...
متن کامل